Signal permuting

ABSTRACT

A method of constructing a permuting network. A configuration for layers of a permuting network is selected based on a set of integer factors of N, the number of signals to be permuted, and on pre-selected types of switches. The permuting network is constructed in layers by using the pre-selected types of switches based on the selected configuration.

TECHNICAL FIELD

[0001] This invention relates to signal permuting.

BACKGROUND

[0002] Signal permuting is used to change the routing paths of signals to connect two interfaces whose sets of signal lines have different orderings. For example, as shown in FIG. 1A, if an interface 10 to input circuitry 11 has lines Lo to L₃ that carry Control, Data 1, Data 2, and Ground signals, and a second interface 12 to output circuitry 13 has lines L₀ to L₃ that carry Data 1, Data 2, Ground, and Control signals, the input and output interfaces are coupled by a 4-by-4 permuting network 14 maps [L₀, L₁, L₂, L₃] to [L₃, L₀, L₁, L₂].

[0003] A permuting network can be implemented using switches. In FIG. 1B, the output of an n:1 switch 16 can reflect any one of its n inputs. By using a number of n:1 switches which receive the same set of n signals at their inputs, n signals can be permuted to any order at their output terminals. FIG. 1C shows a 2-by-2 permuting network implemented using two 2:1 switches 20, 22 that permutes the ordering of two signals S₀ and S₁.

[0004] An n:1 switch can be implemented using an n:1 multiplexer, a 2^(n)×1 random access memory that has its contents written to reflect a single address input, or an n-input lookup-table in a field programmable gate array that is programmed to reflect any one of its inputs at its output. For each type of technology, the types of switches that can be most economically implemented may be different.

DESCRIPTION OF DRAWINGS

[0005] FIGS. 1A-1C are schematic diagrams of switches and permuting networks.

[0006] FIGS. 2-3 are schematic diagrams of permuting networks.

[0007]FIG. 4A is a diagram of a mapping between input and output terminals.

[0008]FIG. 4B is a diagram showing transition of position coordinates of signals.

[0009]FIG. 5A is a schematic diagram of a computer having a permuting network.

[0010]FIG. 5B is a diagram of a mapping between the input and output terminals of the permuting network of FIG. 5A.

[0011]FIG. 6 is a schematic diagram of a permuting network.

DETAILED DESCRIPTION

[0012] The invention is directed towards construction of a permuting network using a predetermined set of building blocks. The building blocks are programmable switches of one or more type, each type being able to select one signal from among several signals. The permuting network is configured as a multidimensional switch array permuter (MSAP) that can permute the ordering of signals by programming the switches to select signals according to a permuting algorithm.

[0013] In the figures, each permuter has input terminals at its left and output terminals at its right. Referring to FIG. 2, a multidimensional switch array permuter (MSAP) 200 that can permute N=ω₁×ω₂× . . . ×ω_(D) signals is implemented using 2D−1 layers of switches, layers L₁ through L₂D−1. The numbers ω₁, ω₂, . . . ω_(D) are integers. Each layer contains the same number (N) of switches of the same type, but the types of switches may differ among different layers. For λ=1 to D, layer L_(λ) includes ω_(λn):1 switches, each of which can select one signal from among ω_(•) signals. For example, layer L₁ contains ω₁:1 switches, layer L₂ contains ω₂:1 switches, and layer L_(D) contains ω_(D):1 switches. For λ=D+1 to 2D−1, layer L_(λ) contains (2D−λ):1 switches. For example, layer L_(D+1) contains ω_(D−1):1 switches, and layer L_(2D+1) contains ω₁:1 switches. By appropriately connecting the output terminals of switches in one layer to the input terminals of switches in the next layer, and appropriately programming the switches to select input signals, MSAP 200 can permute the ordering of a set of signals at its input into any arbitrary ordering of the signals at its output.

[0014] The input signals of the switches in layer L₁ are the N input signals of the MSAP. The ω₁:1 switches in the layer L₁ are configured to form a number of ω₁-by-ω₁ permuters 202, the number being N/ω₁. Each ω₁-by-ω₁ permuter 202 includes a number of ω₁:1 switches, the number being ω₁. Each ω₁by-ω₁ permuter 202 receives ω₁ signals from the input terminals 204, and outputs a set of re-ordered ω₁ signals on the output terminals 206. The output signals 206 of permuters 202 are sent to the input terminals of permuters in layer L₂. The ω₂:1 switches in layer L₂ are configured to form a number of ω₂-by-ω₂ permuters 208, the number being N/ω₂. Each ω₂-by-ω₂ permuter 208 receives ω₂ signals at its input terminals 210, and outputs a set of re-ordered ω₂ signals at its output terminals 212. The output signals of permuters 208 are sent to the input terminals of permuters in layer L₃, and so forth.

[0015] In general, the switches in layer L_(λ)(λ=1 to D) are grouped into ω_(λ)-by-ω_(λ) permuters, each permuter receiving ω_(λ) signals from the previous layer and generating a set of re-ordered ω_(λ) signals at its output. The switches in layer L_(λ)(λ=D+1 to 2D−1) are grouped into ω_(2D−λ)-by ω_(2D−λ) permuters, each permuter receiving ω_(2D−λ) signals from the previous layer and generating a set of re-ordered ω_(2D−λ) signals at its output. The switch outputs in the last layer, L_(2D−1), are the outputs of the MSAP, which is a set of re-ordered N signals.

[0016] The connections between the output terminals of one layer and the input terminals of the next layer are fixed (e.g., hardwired). The connections are represented by the regions 214 enclosed in dashed lines. The connections between each layer are configured so that each of the output signals of a permuter is sent to a different permuter in the next layer. Thus, each ω_(λ)-by ω_(λ) permuter receives ω_(λ) input signals from ω_(λ) different permuters in the previous layer, permutes the order of the ω_(λ) signals, and sends the re-ordered ω_(λ) signals to ω_(λ) different permuters in the next layer.

[0017] An advantage of the invention is that the MSAP can be designed and constructed using various types of switches, depending on which types are available. For example, to design an MSAP that can permute N signals, the factors we of N are first determined. Usually the number N can be factored in many ways, each of which corresponds to a different design. For example, suppose the factors ω_(i) are chosen so that N=ω₁×ω₂× . . . ×ω_(D), then a MSAP having D dimensions, with ω_(λ) being the width of the λ-th dimension can be designed. The meaning of “width” and “dimension” will be apparent from later descriptions.

[0018] For example, there are several MSAP configurations that can permute 48 signals:

[0019] A two-dimensional MSAP can be designed by choosing N=48, D=2, ω₁=6, and ω₂=8. The 48-by-48 MSAP can be built by using a first layer of 6:1 switches, a second layer of 8:1 switches, and a third layer of 6:1 switches.

[0020] A three-dimensional MSAP can be designed by choosing N=48, D=3, ω=3, ω₂=4, and ω₃=4. The 48-by-48 MSAP can be built by using a first layer of 3:1 switches, second, third, and fourth layers of 4:1 switches, and a fifth layer of 3:1 switches.

[0021] A five-dimensional MSAP can be designed by choosing N=48, D=5, ω₁32 2, ω₂=2, ω₄=2, and ω₅=3. The 48-by-48 MSAP can be built using first through fourth layers of 2:1 switches, a fifth layer of 3:1 switches, and sixth through ninth layers of 2:1 switches.

[0022] The advantage is apparent when different environments (e.g., different technologies for implementing the switches, or different brands of products of the same technology) provide different types of switches, or allow different types of switches to be implemented most economically. For example, in one environment, the basic building block may be 2:1 switches, and all other n:1 switches are constructed from the 2:1 switches. In another environment, the basic building blocks may include both 2:1 and 3:1 switches, and larger switches are built from these two types of switches. An N-by-N MSAP may be constructed using building blocks that are the most economical for that particular environment.

[0023] Following the previous example, to permute 48 signals, it is also possible to select a configuration of MSAP in which N is chosen to be larger than 48, so that some MSAP inputs and outputs are left unused. This structure uses more logic circuits, but may allow certain MSAP configurations to be available. For example, by choosing N=64, the following three configurations can be used to construct a 64-by-64 MSAP:

N=64, D=2, ω₁=8, ω₂=8.

N=64, D=3, ω₁=4, ω₂=4, ω₃=4.

N=64, D=6, ω₁=2, ω₂=2, ω₃=2, ω₄=2, ω₅=2, ω₆=2.

[0024] In general, for a particular environment, if larger switches have to be built from smaller switches, then to permute the same number (N) of signals, a higher dimensional MSAP using smaller switches will use less logic circuitry. In the above example, if the 8:1 and 4:1 switches have to be built from 2:1 switches, then among the three configurations, the six-dimensional MSAP using 2:1 switches is more economical (i.e., will use the smallest number of switches overall). Because the dimensional width ω_(k) is proportional to the sizes of switches, smaller switches and less logic circuitry can be used by choosing ω₁, ω₂, . . . ω_(D) as close together in value as possible. For example, when N=n^(D), the most economical configuration is to choose ω₁=ω₃= . . .=ω_(D)=n. For example, if larger switches have to be constructed from 2:1 switches, then the most efficient design for MSAP is to choose n=2 and use 2:1 switches as the basic building blocks of the MSAP. See Table 1 below for a comparison of the efficiency of MSAP's constructed using switches of various sizes.

[0025] In the example shown in FIG. 3A, a 12-by-12 MSAP 300 is constructed from two layers of 3:1 switches and one layer of 4:1 switches. MSAP 300 includes layers L₁ to L₃. Layer L₁ includes 3:1 switches that are configured to form 3-by-3 permuters 302, 304, 306, and 308. Layer L₂ includes 4:1 switches that are configured to form 4-by-4 permuters 310, 312, and 314. Layer L₃ includes 3:1 switches that are configured to form 3-by-3 permuters 316, 318, 320, and 322. Within each permuter, different switches select different input signals so that there is no overlap in their selections, resulting in a re-ordering of the input signals appearing at the output terminals of the permuter.

[0026] Each 3-by-3 permuter in layer L₁ receives 3 signals at its input terminals, permutes the ordering of the 3 signals, and sends the re-ordered 3 signals to three different permuters in layer L₂. For example, the first output terminal of permuter 302 is connected to the first input terminal of permuter 310. The second output of permuter 302 is connected to the first input of permuter 312. The third output of permuter 302 is connected to the first input of permuter 314, and so forth. In this way, each of the three output signals of a permuter in the layer L₁ will be re-ordered by a different permuter in the layer L₂.

[0027] Likewise, each output of a permuter in the layer L₂ is connected to an input of a different permuter in layer L₃. Thus, each of the four output signals of a permuter in layer L₂ will be re-ordered by a different permuter in layer L₃. By properly re-ordering signals using each permuters, the 12-by-12 MSAP can re-order 12 input signals into any orderings of the 12 output signals.

[0028] As an illustration of how permuters re-order signals, dashed lines are shown inside each permuter representing routing of signals from input terminals to output terminals. Twelve input signals with the ordering [[S₁, S₂, . . . S₁₂] are re-ordered into ordering [S₇, S₅, S₃, S₉, S₈, S₁, S₁₀, S₂, S₆, S₁₂, S₁₁, S₄] at the output of MSAP 300. The following illustrates how signal S₃ is re-ordered from line #3 at the input terminal at the left of the figure to line #6 at the output terminal at the right of the figure. Permuter 302 re-orders signal S₃ from input #3 to output #1, which is connected to input #1 of permuter 310. Permuter 310 re-orders signal S₃ from input #1 to output #2, which is connected to input #4 of permuter 318. Permuter 318 then re-orders signal S₃ from input #4 to output #6. The inputs and outputs of the permuters in each layer are numbered #1 to #12 from top to bottom in the figure.

[0029] The following describes the connections between the output terminals of switches in one layer and the input terminals of switches in the next layer. These connections are fixed (e.g., 1 5 hardwired). For description purposes, the switches in each layer of the MSAP are said to form a D-dimensional array. The switches in layer L_(λ) are denoted as S[λ][x₁, x₂, . . . x_(D)], with 1□x_(k)□ω_(k), k ranging from 1 to D. The number x₁ represents the 1^(st) dimension coordinate of the switch, the number X₂ represents the 2^(nd) dimension coordinate, and so forth. The inputs to these switches will be denoted by adding subscripts to the ends of switch names. For example, switch S[1][2,3,4] has inputs S[1][2,3,4]₁, S[1][2,3,4]₂,etc. The outputs will be donoted by adding an asterisk (e.g. S[1][2,3,4]*).

[0030] The output signals of the MSAP are the outputs of the last layer L_(2D−1), i.e., the MSAP outputs are S[2D−1][x₁, x₂, . . . , x_(D)]* for the various combinations of x₁, x₂, . . . x_(D). To simplify description of notation for the switches, a fictional layer L₀ is used so that the MSAP inputs are the L₀ outputs. Thus, the MSAP inputs are S[0][x₁, x₂, . . . , x_(D)]* for the various combinations of x₁, x₂, . . . , x_(D). Although there is no switch S[0][1, . . . , 1], there is a signal S[0]1, . . . 1]*, which is the MSAP input.

[0031] For λ=1,2, . . . , D, the permuters in layer L_(λ) arbitrarily permute the outputs of the permuters in the previous layer along dimension λ only. This means that the switches in a permuter in layer L_(λ) only receive signals from switches in the previous layer having labels that are different in dimension λ, and only permute those signals. Each of the ω_(λ:)1 switches in layer L₈₀ (λ□D), denoted S[λ][x₁,x₂, . . . , x_(D)], has ω_(λ) inputs, i.e., S[λ][x₁,x₂, . . . , x_(D)]₂, . . . , S[λ][x₁, x₂, . . . , x_(D)](ω_(λ)). These inputs map to all the outputs of the previous layer with switch coordinates differing only in dimension λ:

S[λ][x₁, x₂, . . . , x_(λ), . . . , x_(D)]₁=S[λ−1][x₁, x₂, . . . , 1, . . . , x_(D)]*

S[λ][x₁, x₂, . . . , x_(λ), . . . , x_(D)]₂=S[λ−1][x₁, x₂, . . . , 2, . . . , x_(D)]*

S[λ][x₁, x₂, . . . , x_(λ), . . . , x_(D)](ω_(λ))=S[λ−1][x₁, x₂, . . . , ω_(λ), . . . , x_(D)]*

(1□λ□D; any x₁, x₂, . . . , x_(D))

[0032] For λ=D+1, D+2, . . . , 2D−1, layer L_(λ) arbitrarily permutes the outputs of the previous layer along dimension 2D−λ only. This means that the switches in a permuter in layer L_(λ) only receive signals from switches in the previous layer having labels that are different in dimension 2D−λ, and only permute those signals. Each of the ω_(2D−λ):1 switches in layer L_(λ)(λ>D), denoted S[λ][x₁, x₂, . . . , x_(D)], has ω_(2D−λ) inputs, i.e., S[λ][x₁, x₂, x_(d)]₂, . . . , S[λ][x₁, x₂, . . . , x_(D)](ω_(2D−λ)). These inputs map to all the outputs of the previous layer with switch coordinates differing only in dimension 2D−λ:

S[λ][x₁, x₂, . . . , x_(2D−λ), . . . , x_(D)]₁=S[λ−1][x₁, x₂, . . . , 1, . . . , x_(D)]*

S[λ][x₁, x₂, . . . , x_(2D−λ), . . . , x_(D)]₂=S[λ−1][x₁, x₂, . . . , 2, x_(D)]*

S[λ][x₁, x₂, . . . , x_(2D−λ), . . . , x_(D)](ω_(2D−λ))=S[λ−1][x₁, x₂, . . . , x_(2D−λ), . . . , x_(D)]*

(D+1□λ1D−1; any x₁, x₂, . . . , x_(D))

[0033] Referring to FIG. 3B, the switches in MSAP 300 have been labeled as S[λ][x₁, x₂], λ ranging from 1 to 3, x₁ ranging from 1 to 3, and x₂ ranging from 1 to 4. For convenience of notation, the MSAP input signals are described as the output signals of layer L₀, and are labeled as S[0][i,j]*, i ranging from 1 to 3, and j ranging from 1 to 4. Layer L₁ permutes the signals along the first dimension. This means that the switches in layer L₀, whose outputs are connected to the inputs of a switch in layer L₁, have labels with different coordinates in the first dimension but same coordinates in the second dimension. For example, the three inputs of switch S[1][2,3] are connected to outputs of switches in layer L₀ as follows:

S[1][2,3]₁=S[0][1,3]*

S[1][2,3]₂=S[0][2,3]*

S[1][2,3]₃=S[0][3,3]*

[0034] Layer L₂ permutes the signals along the second dimension. This means that the switches in L₁, whose outputs are connected to the inputs of a switch in L₂, have labels with different coordinates in the second dimension but same coordinates in the first dimension. For example, switch S[2][2,3] has its four inputs connected as follows:

S[2][2,3]₁=S[1][2,3]*

S[2][2,3]₂=S[1][2,2]*

S[2][2,3]₃=S[1][3,2]*

S[2][2,3]₄=S[1][2,4]*

[0035] Layer L₃ permutes the signals along the first dimension. For example, switch S[3][2,3] has its three inputs connected as follows:

S[3][2,3]₁=S[2][1,3]*

S[3][2,3]₂=S[2][2,3]*

S[3][2,3]₃=S[2][3,3]*

[0036] The output signals of MSAP 300 are the 12 switch outputs from layer L₃, i.e., S[3][1,1]* through S[3][3,4]*.

[0037] An MSAP switch topology has a recursive structure. If the first and last layers of switches (L₁ and L_(2D−1)) are removed from an MSAP of dimension D>1, the remaining portion is a number of smaller MSAP's of dimension D−1, the number being ω₁. In general, in an MSAP with dimension D>1, for each fixed coordinate c between 1 and ω₁, in the first dimension, the subset of switches {S[λ][c, x₂x₃, . . . , x_(D)]}(λ ranging from 2 to 2D−2 and each x_(k) ranging from 1 to ω_(k)) forms a sub-MSAP of dimension D−1 with dimensional widths ω₂, ω₃, ω_(D). Each sub-MSAP has its own sub-MSAP's, and so forth, down to the level of one-dimensional MSAP's. This recursive structure of the MSAP is reflected in the MSAP switching algorithm described below.

[0038] The switches in an MSAP are configured according to a switching algorithm so that the MSAP can permute signals in any chosen order. To facilitate description of the algorithm, each input signal to the MSAP is described as having a starting position and a target position. The position of a signal in a particular layer is written as [x₁, x₂, . . . , x_(D)], and corresponds to the label of the switch that selected the signal as output. For example, in FIG. 3B, the signal at input line #3 has a starting position of [3,1] because the signal is selected by switch S[0][3,1] and reflected at its output. As described previously, layer L₀ is a fictional layer and used just for convenience of description. When the signal propagates to layer L₁, its position changes to [1,1] because the signal is selected by switch S[1][1,1]. When the signal propagates to layer L₂, its position changes to [1,2] because the signal is selected by switch S[2][1,2]. Then in layer L₃, the position of the signal changes to [3,2] because the signal is selected by switch S[3][3,2].

[0039] In a three dimensional (D=3) MSAP, suppose an input signal has a starting position of [2,5,3] and a target position of [4,1,6]. This means that the MSAP switches must be set in a manner so that the input signal S[0][2,5,3]* is routed to the output signal S[5][4,1,6]*. As a signal propagates through the MSAP, the signal's position is adjusted in each switch layer, reflecting the switches that the signal has propagated through. Because each layer only permutes the signals in one dimension, only one coordinate of the signal's position is adjusted in a layer. For example, in the D=3 MSAP, a signal may follow the position path:

L₀:[2,5,3]

L₁:[8,5,3]

L₂:[8,7,3]

L₃:[8,7,6]

L₄:[8,1,6]

L₅:[4,1,6]

[0040] This position path determines how one switch in each layer must be set. In order that the signal propagates from Lo position [2,5,3] to L₁ position [8,5,3], switch S[1][8,5,3] must be set to position 2. In order that the signal propagates from L₁ position [8,5,3] to L₂ position [8,7,3], switch S[2][8,7,3] must be set to position 5, and so forth. When all of the position paths for the MSAP's input signals have been determined, all of the switches in the MSAP can be set so that all input signals are routed to the proper output terminals of the MSAP. Thus, a collection of N position paths (or signal paths) can be used to determine a switch-setting configuration for an MSAP.

[0041] The MSAP Switching Algorithm

[0042] The MSAP switching algorithm provides a way of configuring an MSAP to route input signals to achieve every possible signal permutation at the output terminals. However, the configuration provided by the switching algorithm is not the only way that an MSAP can be configured to permute signals. It is possible to use a “brute-force” method by looking through all possible switching combinations in some intelligent (e.g., recursive) manner to find a solution.

[0043] The MSAP switching algorithm includes Algorithm 1, which calls Algorithm 2. In the description of Algorithm 1 below, the MSAP switching procedures are described in terms of signal position paths rather than in terms of MSAP switch settings. There are N signals, each of which has a D-dimensional starting position and a D-dimensional target position. Algorithm 1 determines the position paths for all the signals, giving each signal's position coordinates in each layer from L₀ to L_(2D−1) in a manner that is allowed by the MSAP topology. No two signals can occupy the same position in any layer. The L₀ position of each signal is its starting position. The L_(2D−1) position of each signal generated by Algorithm 1 will match the signal's target position.

[0044] Algorithm 1

[0045] Algorithm 1 calls itself recursively. Each time Algorithm 1 is called, a parameter Σ that represents a subset of the input signals is passed on to Algorithm 1. When Algorithm 1 is called initially, parameter Σ={all the signals} is passed to Algorithm 1. In subsequent recursive calls, parameter Σ will contain successively smaller subsets of signals. Algorithm 1 keeps track of the depth of its recursion by passing a level parameter λ. When Algorithm 1 is initially called, parameter λ=1 is given to the algorithm. When Algorithm 1 calls itself again, parameter λ=2 is passed, and so forth.

[0046] For any given N, D, ω₁, ω₂, . . . , and ω_(D) such that N=ω₁×ω₃× . . . ×ω_(D), Algorithm 1 performs the steps below using the parameter λ to represent the recursion depth and Σ to represent the subset of signals to operate on. Initially, set A=1.

[0047] Step 1: If λ=D, jump to step 5.

[0048] Step 2: Copy the L_(λ−1) position of each signal in Σ to its L_(λ) position.

[0049] Step 3: Call Algorithm 2 to permute the L_(λ) positions of signals in Σ, changing only the λ^(th) position coordinate of each signal, in such a way that no two signals in Σ with the same λ^(th) position coordinate have identical (λ+1)^(st) to D^(th) target coordinates.

[0050] Step 4: For each c, ranging from 1 to ω_(λ), recursively call Algorithm 1, passing parameters:

[0051] Σ=the subset of signals in Σ which have λ^(th) position coordinates equal to c in layer L_(λ); and

λ=λ+1.

[0052] Step 5: Permute the signals in Σ from their positions in L_(2D−1) to new positions in L_(2D−λ), by changing the λ^(th) position coordinate of each signal to its λ^(th) target coordinate.

[0053] The permutations performed in Steps 3 and 5 of Algorithm 1 permutes signals only along a particular dimension allowable for that layer. For example, the switches in layer L_(k) permute signals only along dimension k. Algorithm 1 also follows the rule that no two signals occupy the same position in a given layer. This rule can be ensured because Algorithm 2, as described below, moves signals by swapping pairs of signals, thereby preventing collisions of signals (i.e., two signals being selected by the same switch).

[0054] Algorithm 2

[0055] The following is a description of Algorithm 2 called by Step 3 of Algorithm 1. Due to the recursive nature of Algorithm 1, in layer L_(λ), all the signals in Σ will have the same 1^(st) to (λ−1)^(st) position coordinates. Their λ^(th) to D^(th) position coordinates, on the other hand, will run through all possible combinations. Thus, although each signal still has D position coordinates, Σ is effectively a (D−λ+1)-dimensional array of signals according to positions in layer L_(λ). The λ^(th) position coordinate of each signal in Σ determines the “slice” of this array that signal is in. The (λ+1)^(st) to D^(th) position coordinates of each signal determine the “column” of the array that the signal is in. The definitions of “slice” and “column” are given later. Two signals with the same λ^(th) position coordinates are in the same slice, and two signals with identical (λ+1)^(st) to D^(th) position coordinates are in the same column. Thus, the array of signals in Σ is divided into ω_(λ) different slices, numbered 1 through ω_(λ). The array is also divided into ω_(λ+1)×ω_(λ+2)× . . . ×ω_(D) different columns, not numbered. Step 3 of Algorithm 1 only changes the λ^(th) position coordinate of each signal so that a signal only moves within its column.

[0056] The following is a description of the terms “slice” and “column.” A slice is a subset of the available position coordinates containing subsets of Σ of signals that are processed by Algorithm 2. For example, consider an MSAP with D=5, ω₁=6, ω₂=5, ω₃→4, ω₄=3, and ω₅=2. In this case, a total of 6×5×4×3×2=720 signals are permuted, each of which has 5-dimensional starting position coordinates and 5-dimensional target position coordinates. For example, the signal starting in one position [1,1,1,1,1] may have a target position [6,5,4,3,2]. Suppose Algorithm 2 is called when λ=3, representing the ₃ ^(rd) level of recursion in Algorithm 1. In this case, Σ is a subset of signals with constant L₂ position coordinates in dimensions 1 and 2. For example, all the signals in Σ may have L₂ position coordinates of the form [4,2,_,_,_], so that Σ contains exactly 4×3×2=24 signals: [4,2,1,1,1], . . . ,[4,2,4,3,2].

[0057] In Step 2 of Algorithm 1, the L₂ positions of the signals in Σ are copied to their L₃ positions, so the L₃ positions of the 24 signals are also of the form [4,2,_,_,_]. Algorithm 2 is called to rearrange the 24 signals in Σ within layer L₃, i.e., Algorithm 2 changes the L₃ position coordinates of the 24 signals in Σ. Algorithm 2 divides the 24 signals in Σ into 4 “slices” and 6 “columns” based on their position coordinates. Each of the 4 slices have 6 signals, and each of the 6 columns have 4 signals.

[0058] The 4 slices contain those signals with a given 3^(rd) (since λ=3)L₃ position coordinate. There are 4 slices because λ=3 and w₃=4. The slices are called “slice 1” through “slice 4.” Slice k contains those signals in Σ whose 3^(rd) L₃ position coordinate is k. For example, slice 3 contains 6 signals whose L₃ position coordinates are [4,2,3,1,1], [4,2,3,1,2], [4,2,3,2,1], [4,2,3,2,2], [4,2,3,3,1], and [4,2,3,3,2]. During this call to Algorithm 2, the signals in slice 3 ha these six L₃ positions, although the particular signals occupying those positions may change.

[0059] When Algorithm 2 swaps two signals, those two signals will exchange slices, meaning that they will swap their 3 ^(rd) L₃-position coordinates. The 6 columns, meanwhile, contain hose signals with a given combination of 4 ^(th) and 5^(th L) ₃ position coordinates (the coordinates above λ=3). Thus, there is a column whose signals have L₃ position coordinates [4,2,1,1,1], [4,2,2,1,1], [4,2,3,1,1] and [4,2,4,1,1]. There is another column whose signals have L₃ position coordinates [4,2,1,3,2], [4,2,2,3,2], [4,2,3,3,2] and [4,2,4,3,2]. The term “column,” as used in this example, describes a set of L₃ position coordinates with their 4^(th) and 5^(th) dimension coordinates fixed. Thus, signals with L₃ position coordinates [4,2,2,3,1] and [4,2,4,3,1] are said to be in the “same column” because their 4^(th) and 5^(th) coordinates [3,1] match. In Algorithm 2, a column always retains the same set of signals after permutation because the swapping action only exchanges two signals in the same column.

[0060] Having described the definition of the terms “slice” and “column,” the following is a description of the steps performed by Algorithm 2:

[0061] 1. FOR S=1 TO ω_(λ)−1 DO:

[0062] 2. WHILE slice S contains at least 2 signals with identical (λ+1)st to D^(th) target coordinates DO:

[0063] 3. SET X₀=one of these signals

[0064] 4. FOR K=0,1, 2, . . . DO:

[0065] 5. Look for a signal Z in slices S+1 through ω_(λ) and in the same column as

[0066] X_(K) (selected in Step 3 or 6) with the following property: Z's (λ+1)st to D^(th) target coordinates are not identical to the (λ+1)^(st) to D^(th) target coordinates of any signal in slice S. IF found THEN BREAK from the “FOR K” loop.

[0067] 6. Select a pair of signals signal Y_(K+1) and X_(K+1) with the following properties:

[0068] Y_(K+1) is in slices S+1 through ω_(λ) and in the same column as X₀, X₁, . . . , or X_(K);

[0069] X_(K+1) is in slice S, and is different from X₀, X₁, . . . , and X_(K);

[0070] Y_(K+1) and X_(K+1) have identical (λ+1)^(st) to D^(th) target coordinates.

[0071] There will always be such a pair of signals.

[0072] 7. END FOR K

[0073] 8. LOOP:

[0074] 9. Swap signal Z with signal X_(K)

[0075] 10. IF K=0 THEN BREAK from the LOOP

[0076] 11. SETZ=Y_(K)

[0077] 12. Z is now in the same column as some X_(J). SET K to this J

[0078] 13. END LOOP

[0079] 14. END WHILE

[0080] 15. END FOR S

[0081] Algorithm 2 works by “fixing” one slice at a time, i.e., once a slice is fixed, its signals are not changed in successive calls to the algorithm. The WHILE condition of step 2 checks the current slice for precisely the property that Algorithm 2 is trying to eliminate. When this property is no longer found, that slice is fixed. A fairly small number of times through Steps 3-13 always fixes a slice.

[0082] The goal of Steps 3-13 is to move a signal with “duplicate” (λ+1)^(st) to D^(th) target coordinates out of layer S, replacing it with some other signal from above layer S whose (λ+1)^(st) to D^(th) target coordinates are missing from layer S. In the ideal case where these two signals are in the same column, Algorithm 2 just swaps the two signals. However, such a pair of signals may not be in the same column, so a chain of swaps are performed, each of which operates within a different column. The first swap operation moves a signal with missing (Xλ30 1)^(st) to D^(th) target coordinates into layer S, and the last swap operation moves a signal with duplicate (λ+1)^(st) to D^(th) target coordinates out of layer S.

[0083]FIG. 4A shows an example of a mapping between the input and output terminals of MSAP 300. FIG. 4B shows an example of how the position coordinates of the input signals of a 12-by-12 MSAP are changed from their starting position coordinates to their target position coordinates. The changes in position coordinates were determined using Algorithm 1 and Algorithm 2 above. Layer L₁ only changes the coordinates of the first dimension, layer L₂ only changes the coordinates of the second dimension, and layer L₃ only changes the coordinates of the first dimension.

[0084] An advantage of an MSAP is that it efficiently uses pre-selected switches as basic building blocks. The simplest and most efficient MSAP's use only n:1 switches (n being an integer) to permute N=n^(D) signals. In such a case, there are 2D- 1 layers, each containing N switches. Because D=log_(n)N, there is a total of N×(2 log_(n)N−1)n:1 switches. Therefore, the amount of logic circuitry required to construct an MSAP using pre-selected switches with fixed switch sizes is on the order of N log N.

[0085] A comparison of the efficiency of MSAP's constructed using switches of various sizes is given below in Table 1. Here, it is assumed that larger switches are built from smaller switches. Table 1 shows that an MSAP using larger switch sizes requires more logic circuitry. For example, if 2:1 switches are available, then building an N=64 MSAP is the most efficient using eleven 6-dimensional layers of 2:1 switches. Using five 3-dimensional layers of 4:1 switches requires 36% more logic circuitry. Using three 2-dimensional layers of 8:1 switches requires 91% more logic circuitry. Using one one-dimensional layer of 64:1 switches (effectively a set of parallel multiplexers) requires 473% more logic circuitry. TABLE 1 2:1 switches Equivalent Ratio n:1 switches Total n:1 comprising each total 2:1 vs. Difference n per layer Layers switches n:1 switch (= n − 1) switches n = 2 vs. n = 2 2 64 11 704 1 704 100%  0% 4 64 5 320 3 960 136%  36% 8 64 3 192 7 1344 191%  91% 64 64 1 64 63 4032 573% 473%

[0086] An application of an MSAP is to permute signals between a motherboard and a daughter board in a computer. Referring to FIG. 5A, a computer 500 includes a daughter board 506 coupled to a motherboard 502 through an interface 504 that has signals lines #1 to #12. The daughter board 506 includes a memory 512 that can be accessed by motherboard 502. Daughter board 506 and motherboard 502 are manufactured by different companies so that motherboard 502 sends signals on the signal lines of interface 504 in an order that is different from the order recognized by memory 512.

[0087] Daughter board 506 includes a permuting network 508 that has input terminals connected to interface 504 and output terminals connected to an interface 510, which in turn is connected to memory 512. Permuting network 508 routes the signals in the signal lines of interface 504 to the signal lines of interface 510 with a new ordering that is acceptable to memory 512. The mapping between signal lines of interface 504 and signal lines of interface 510 is shown in FIG. 5B. As an example, the signal s₁ is line #1 of interface 504 is mapped to line #7 of interface 510, and signal s12 on line #12 of interface 504 is mapped to line #4 of interface 510. Similarly, the signals sent from memory 512 are re-ordered by permuting network 508 and then passed on to motherboard 502.

[0088] Permuting network 508 is implemented using a field programmable logic array (FPGA). The FPGA includes configurable logic components, each of which can be programmed to receive a number of inputs and to pass any one of the inputs to its outputs, just like an n:1 switch. When computer 500 is first booted, daughter board 506 communicates with motherboard 502 to determine the required permutation. A processor (not shown) on daughter board 506 executes an MSAP switching algorithm to program the configurable logic components to construct an MSAP that achieves the required permutation.

[0089] In the example shown in FIG. 6, a 18-by- 18 MSAP 600 is constructed from four layers of 3:1 switches and one layer of 2:1 switches. In this example, N=18=3×3×2, D=3, ω₁=3, ω₁3, and w₃=3. MSAP 600 includes layers L₁ to L₅. Layers L₁, L₂, L₄, and L₅ include 3:1 switches that are configured to form 3-by-3 permuters. Layer L₃ includes 2:1 switches that are configured to form 2-by-2 permuters. Within each permuter, different switches select different input signals so that there is no overlap in their selections. This results in a re-ordering of the input signals appearing at the output terminals of the permuter.

[0090] In FIG. 6, the switch labels are shown inside the permuters. The three switches in a permuter 602 in layer L₁ have the same 2 ^(nd) and 3 ^(rd) coordinates but different 1^(st) coordinates. Thus, permuter 602 permutes the position coordinates of input signals in the first dimension. Likewise, other permuters in layer L₁ also permute the position coordinates of input signals in the first dimension. The three switches in a permuter 604 in layer L₂ have the same 1^(st) and 3^(rd) coordinates but different 2^(nd) coordinates. Thus, permuter 604 permutes the position coordinates of input signals in the second dimension. Likewise, other permuters in layer L₂ also permute the position coordinates of input signals in the second dimension.

[0091] In a similar manner, permuters in layer L₃ permute the position coordinates of input signals in the third dimension. Permuters in layer L₄ permute the position coordinates of input signals in the second dimension. Permuters in layer L₅ permute the position coordinates of input signals in the first dimension. Each of the output signals of a permuter in layer L₁ is sent to a different permuter in layer L₂, each of the output signals of a permuter in layer L₂ is sent to a different permuter in layer L₃, and so forth.

[0092] MSAP 600 is designed to allow any of the 18 input signals to be routed to any of the 18 output terminals. In operation, the MSAP switching algorithm is performed to determine the transition paths for the 18 input signals. The transition paths go through the five layers of switches in a way such that each signal is routed to the desired output terminal and that no two signals occupy the same switch. The switches are then programmed to select input signals according to the signal paths determined by the MSAP switching algorithm.

[0093] Other embodiments are within the scope of the following claims. For example, the switches may be implemented using devices other than an FPGA. The switches may be programmed by a processor on the motherboard rather than by a processor on the daughter board. The configuration of switches (i.e., which signal is selected from its inputs and reflected at the output) may be dynamically programmable, or be programmable once and then fixed permanently. An MSAP that performs bit permutations can be used as a building block of an encoder/decoder that encrypts/decrypts messages. The MSAP can be used to construct telecommunication networks to route signals from nodes at one location to nodes at another location in a non-blocking manner. The MSAP can also be used in massively parallel or supercomputing applications to route signals among different processors. 

What is claimed is:
 1. A method comprising: selecting a configuration for layers of a permuting network based on a set of integer factors of N, the number of signals to be permuted, and on pre-selected types of switches; and constructing the permuting network in layers by using the pre-selected types of switches based on the selected configuration.
 2. The method of claim 1 in which each of the types of switches is capable of selecting one signal from among a number of signals, the number being different for different types of switches.
 3. The method of claim 2 in which each of the integer factors corresponds to the number of signals that one type of switches can select from.
 4. The method of claim 1, further comprising selecting the set of integer factors ω₁, ω₂ . . . , ω_(D) (D being an integer) such that N=ω₁×ω₂× . . . , >ω_(S).
 5. The method of claim 4 in which the permuting network is configured to have 2D−1 layers of switches, the switches including ω₁:1, ω₂:1, . . . , and ω_(D):1 switches or are constructed from ω₁:1, ω₂:1, . . . , and ω_(D):1 switches.
 6. A method comprising: receiving N signals; and re-ordering the N signals using a permuting network constructed from layers of switches having a configuration based on a set of integer factors of N and on pre-selected types of switches.
 7. The method of claim 6 in which each layer has N switches of the same type, each type of switch having a predefined number of input terminals and one output terminal.
 8. The method of claim 7 in which each layer of the permuting network groups the N signals into subsets of signals and permutes the ordering of the subsets of signals, the number of signals in the subsets being equal to the number of input terminals that each switch in the layer has.
 9. The method of claim 7 in which the permuting network is constructed by assigning multi-dimensional coordinates to the switches, each switch in a layer having a different coordinate, and configuring the switches so that when a signal propagates from a first switch in one layer to a second switch in the next layer, the coordinates of the two switches differ in one dimension only.
 10. The method of claim 6 in which the integer factors are ω₁,ω₂, . . . , ω_(D) (D being an integer) such that N=ω₁×ω₂× . . . ×ω_(D), and the pre-selected types of switches include ω₁1, ω₂:1, . . . , and ω_(D):1 switches.
 11. Apparatus comprising: N input terminals, N being an integer; N output terminals; and a permuting network configured to form non-blocking signal paths that connects the input terminals to the output terminals in an arbitrary order, the permuting network constructed from layers of switches of different types, each layer having the same number of switches of the same type, each type of switch capable of selecting one signal from among a predefined number of signals.
 12. The apparatus of claim 11 wherein each switch has input and output terminals, the input terminals of the switches in the first layer coupled to the N input terminals of the apparatus, the output terminals of the switches in the last layer coupled to the N output terminals of the apparatus, and for all layers except the last layer, the output terminals of the switches are connected to the input terminals of the switches in the next layer.
 13. The apparatus of claim 12 in which the number of layers and the connection between switches of successive layers are based on a set of integer factors of N and on the types of switches used.
 14. The apparatus of claim 13 in which the integer factors are ω₁,ω₂, . . . , ω_(D) (D being an integer) such that N=ω₁×ω₂× . . . ×ω_(D), and the types of switches used include ω₁:1, ω₂:1, . . . , and ω_(D):1 switches.
 15. The apparatus of claim 14 in which the permuting network is configured to have 2D−1 layers of switches, each layer permuting the order of different subsets of signal paths.
 16. The apparatus of claim 15 wherein for each p-th layer of switches, p ranging from 1 to D, ω_(p):1 switches are configured to form ω_(p)-by-ω_(p) permuters that are capable of permuting the ordering of ω_(p) signal paths, and for each of the q-th layer of switches, q ranging from D+1 to 2D−1, ω_(2D−q):1 switches are configured to form w_(2D−q)-by-ω_(2D−q) permuters that are capable of permuting the ordering of ω_(2D−q) signal paths.
 17. The apparatus of claim 16 in which each of the input terminals of each permuter in the 2^(nd) layer to the (2D−1 )th layer is connected to the output terminal of a different permuter in the previous layer.
 18. Apparatus comprising: a first device configured to generate N signals having a first ordering; a second device configured to accept the N signals arranged in a second ordering; and a permuting network configured to receive the N signals having the first ordering and re-order the N signals so that the N signals have the second ordering acceptable by the second device, the permuting network constructed from layers of switches of different types, each layer having the same number of switches of the same type, each type of switch capable of selecting one signal from among a predefined number of signals.
 19. The apparatus of claim 18 in which the number of layers and the connection between switches of successive layers are based on a set of integer factors of N and on the types of switches used.
 20. The apparatus of claim 19 in which the second device is a memory.
 21. The apparatus of claim 20 in which the first device is a computer motherboard.
 22. The apparatus of claim 19 in which the integer factors are ω₁, ω₂, . . . , ω_(D) (D being an integer) such that N=ω₁×ω₂× . . . ×ω_(D), and the types of switches used include ω₁:1, ω₂:1 and ω_(D):1 switches.
 23. The apparatus of claim 22 in which the permuting network is configured to have 2D−1 layers of switches, each layer having N switches of the same type, each layer permuting the order of different subsets of the N signals.
 24. A computer program stored on a computer-readable media for causing a computer system to perform the functions of: assigning a multi-dimensional coordinate to each of a set of N signals, N being an integer; in successive operations, changing the coordinates of the N signals for a particular dimension during each operation, such that no two signals have the same coordinates after each operation, so that after the successive operations, the coordinates of the N signals match a set of target coordinates.
 25. The computer program of claim 24, further causing the computer system to perform the function of finding integer factors ω₁, ω₂, . . . , and ω_(D) of N such that N=ω₁×ω₂× . . . ×ω_(D), D being an integer.
 26. The computer program of claim 25 in which the multi-dimensional coordinates are [x₁, x₂, . . . , x_(D)], x_(k) ranging from 1 to ω_(k) for each k, k ranging from 1 to D.
 27. The computer program of claim 26 in which the coordinates of the N signals are changed by swapping the coordinates of a pair of signals.
 28. Apparatus for re-ordering N signals, comprising: (2D−1) layers of switches, D being an integer, the n-th layer and (2D−n)-th layer having ω_(n)-by-ω_(n), switches, n ranging from 1 to D, and ω₁, to ω_(D) being integer factors of N such that N=ω₁×ω₂× . . . ×ω_(D), the first layer of switches re-ordering the order of N signals to generate a first set of re-ordered signals, the i-th layer of switches re-ordering the (i-i )-th set of re-ordered signals to generate an i-th set of re-ordered signals, i ranging from 2 to 2D−1, the (2D−1)-th set of re-ordered signals matching a target ordering of N signals.
 29. The apparatus of claim 28 in which each of the N signals are assigned D-dimensional coordinates, the n-th coordinate ranging from 1 to ω_(n), the p-th layer switches configured to swap signals that differ only in the p-th coordinates with the coordinates in other dimensions being the same, p ranging from 1 to D, and the q-th layer switches configured to swap signals that differ only in the (2D−q)-th coordinates with the coordinates in other dimensions being the same, q ranging from D+1 to 2D−1. 